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Let  X  ^  0  be  a  finite  collection  of  nonempty  relations  over  the  rela¬ 
tion  scheme  R(Ai,  A 2, An);  then  the  closure  of  X  under  embedding 
and  direct  product  (up  to  isomorphism)  is  a  finitely  generated  lm- 
plicational  Dependency  family  (ID-family)  generated  by  X.  In  this 
paper,  we  show  that  the  class  of  finitely  generated  ID-families  is  iden¬ 
tical  to  the  class  of  those  ID-families  which  possess  a  finite  Armstrong 
relation. 

C-\ 

1  Introduction 


Data  dependencies  such  as  functional  dependencies  (FDs),  multivalued  depen¬ 
dencies  (MVDs),  and  join  dependencies  (JDs)  have  played  an  important  role 
in  the  design  of  databases['2][3].  In  addition,  they  have  been  used  as  integrity 
constraints  in  an  integrity-checking  mechanism[3].  The  legal  databases  are 

‘This  research  was  supported  in  part  by  U  S.  Army  Research  Office  under  grant 
#DAAL03-87-G-0004. 
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those  which  obey  the  constraints  specified  by  the  database  administrator 
originally.  Consequently,  we  are  interested  in  studying  families  of  instances 
characterized  by  a  given  set  of  dependencies  such  as  FDs,  MVDs,  etc. 

The  class  of  Implicational  Dependencies  (IDs)  was  defined  by  Fagin[2]  as 
the  logical  generalization  of  the  previously  defined  class  of  full  dependencies. 
Properties  of  ID-families  are  mainly  studied  in  [2]. [4], [5], [7],  in  particular,  it 
is  shown  that  the  collection  of  ID-families  is  closed  under  join  and  projection. 

In[5],  it  is  shown  that  a  collection  of  relations  over  scheme  R(A\.  A-> . in) 

is  axicmatizable  by  IDs  if  and  only  if  it  contains  a  trivial  database  and  it  is 
domain  independent  and  closed  under  embedding  and  direct  products. 

In  this  paper,  we  use  the  above  result  to  establish  that  the  collection 
of  ID-families  with  a  finite  Armstrong  relation  and  the  collection  of  finitely 
generated  ID-families  are  identical. 

Vardi[8]  has  established  a  finite  set  of  IDs  with  no  finite  Armstrong  re¬ 
lation.  This,  together  with  the  above  result,  implies  that  finitely  specifiable 
ID-families  are  not  finitely  generated. 

2  Preliminaries 

In  this  paper,  we  assume  readers  to  be  familiar  with  [2],  and  [5].  We  will 
follow  the  notation  of  [2].  In  addition,  throughout  this  paper  we  only  deal 
with  scheme  R(Ai,  A2, ...,  A„). 

Following  Fagin[2],  we  define  an  Implicational  Dependency  (ID)  to  be  a 
typed  sentence  a  of  the  form  wr:iV;r2.,.Va:m(ai  Aa2...  Aq^  — +  0),  where  each  a, 
is  an  atomic  formula  of  the  fo  m  R(yi,  y2,  •••,  yn)  and  $  is  an  atomic  formula 
of  the  form  R{yu  y2, ...,  t/„)  v  =  </,,  where  yd  €  {*i,  x2, ...,  xm}.  We  also 
assume  that  k  >  I  and  each  i,  occurs  in  some  ar  For  example,  the  formula 
VaV6VciVc2Vd1Vd2/i(a,  6,  C\,  d\ )  A  R(a,  b,  c2,  d2)  — ♦  Ci  =  c2)  represents  the 
FD  AB  — >  C  for  the  4-ary  relation  scheme  R(A,  B,C,  D),  and  the  formula 
VuV6iV62VciVc2/?(a,61,Ci)  A  /?(a,62,c2)  — >  /?(a,6t,c2)  represents  the  MVD 
A  — *— ♦  B  for  the  3-ary  relation  scheme  R{A,  B,C). 


Let  r  and  s  be  relations  for  R  (  our  relations  are  all  finite  relations  ),  then  — — 

we  define  the  direct  product  of  r  and  s,  in  notation  r  x  s,  to  be  the  set  of  all 

r  GpA*I 

tuples  t  ( ( 1 1 1  ?  t’21  )i  (^  12?  ^22  ),.**,(  I  ln>  ^2n  )  )  SUCh  that  t\  —  (  ^  1 1 .  ^  12i  ^  In )  1  TAB 

and  t2  —  (^2i)  ^22?  •••)  ^2n)  €  s.  For  example,  the  direct  product  of  the  first  .ooced 
two  relations  in  the  following  diagram  is  the  third  relation.  'i»a.tlon_ 


By 

iDlstr 

ibut 1  on/ 

|  Availability  Codes 

Dist 

O'1 

Avail  emd/or 
Special 

2 


r 

A 

B 

c 

a 

b 

c 

a t 

bt 

CJ 

! 

A 

B 

C 

ax 

6, 

Cl 

ai 

b2 

; 

! 

i 

C2 

r  x  s 


(a, a i) 

(b,bi) 

(c,Cl) 

{a,a2) 

(bM)  ' 

(c,C2) 

{a>,  a i) 

(b/,bi) 

(c/,Ci) 

(a/,  a2) 

(bf,b2) 

! 

(c/,c2) 

The  direct  product  of  rx  x  r2  x  ...  x  rm  is  defined  as  usual.  Also,  we 
define  Dom(r)  to  be  DornT{Ax)  x  DomT(A2 )  x  ...  x  Domr(An),  where  each 
Domr(A ,)  is  the  set  of  all  the  ith  coordinates  of  r.  For  example,  the  Dom(r) 
in  the  above  diagram  is: 
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For  the  relation  scheme  R(Ai,...,An),  we  also  assume  a  countably  in- 
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finite  underlying  domain  for  each  /l,  from  which  A ,  takes  its  values.  Let 
r  and  s  be  nonempty  relations  for  R,  then  /  =  (/i,  /2,  •  fn)  is  called  an 
embedding  from  s  to  r  if  /,  is  a  1-1  function  from  Dom,{Ax)  to  DomT(At) 
for  each  i  and  for  any  tuple  (ai,...,an)  £  Dom(s),  then  (ai,...,a„)  Gsiff 
(/i(ai),  72(02), /„(an))  <E  r.  In  fact,  embedding  is  a  typed  1-1  homomor¬ 
phism  between  two  structures.  In  case  such  f  exists,  we  say  s  can  be  embedded 
into  r.  An  embedding  f  is  called  an  isomorphism  if  f  is  onto.  We  will  use  the 
notation  r  =  s  to  show  that  r  and  s  are  isomorphic.  A  subset  s  of  r  is  called 
a  substructure  of  r  if  Dom(s)  D  r  =  s.  It  is  obvious  that  if  s  is  a  substructure 
of  r,  then  the  identity  map  from  Dom(s)  to  Dom(r)  is  an  embedding. 

Let  E  be  a  set  of  IDs,  then  SAT(T,)  is  the  set  of  all  finite  relations 
satisfying  E.  A  nonempty  collection  of  relations  F  is  an  ID-family  if  there 
exists  a  set  E  of  IDs  such  that  F  =  S’AT(E).  In  case  E  is  finite,  we  say  F  is 
finitely  specifiable  ID-family. 

Let  E  be  a  set  of  IDs,  then  E,  =  {<7  |  E  )=  cr},  i.e.  E*  is  the  set  of  all  IDs 
which  logically  follow  from  E.  A  relation  r  is  called  an  Armstrong  relation  if 
all  members  of  E,  are  true  in  r  and  all  other  IDs  are  false  in  r.  Armstrong 
relations  and  their  applications  are  extensively  studied  in  [I],  [2],  and  [6]. 

For  any  collection  K  of  relations,  let 


SK  =  {  r  |  r  can  be  embedded  into  some  member  of  K} 

PK  =  {  r  |  r  =  ri  x  r2  x  ...  x  rn  for  rt  members  of  K} 

The  next  theorem  gives  a  characterization  for  ID-families. 

Theorem  2.1  [5]Let  F  be  a  family  of  relations  for  R,  then  F  is  an  ID-family 
iff: 

(1)  F  is  closed  under  P. 

(2)  F  is  closed  under  S. 

(3)  F  contains  a  singleton. 


We  would  like  to  mention  here  that  Makowsky  and  Vardi[5]  use  the  term 
“subdatabase”  instead  of  “substructure”. 


4 


3  Main  Result 

Let  X  =  {n,  r2, rn}  ^  0  be  a  collection  of  nonempty  relations  for  R, 
then  theorem  2.1  implies  that  SPX  is  an  ID-family  generated  by  X  (note 
that  condition  (3)  is  trivially  satisfied  as  any  tuple  t  in  some  r,  will  form  the 
substructure  {<}  for  r,).  In  case  X  contains  a  single  relation,  we  will  say  SPX 
is  singly  generated. 

The  next  two  lemmas  imply  that  the  collection  of  finitely  generated  ID- 
families  and  the  collection  of  singly  generated  ID-families  are  identical. 

Lemma  3.1  Let  si  and  s2  be  substructures  of  rx  and  r 2  respectively,  then 
Si  x  s2  is  a  substructure  of  rq  x  r2. 

Proof.  Straightforward. 

Lemma  3.2  Let  X  —  {rq,  r2, ....  rm}  be  a  collection  of  nonempty  relations 
for  R,  then  SPX  —  SP{rx  x  r2  x  ...  x  rm}. 

Proof.  Let  t2,t3,...,tm  be  tuples  in  r2,r3,...,rm  respectively.  By  lemma 
3.1,  rq  x  {t2}  x  ...  x  {tm}  is  a  substructure  of  rq  x  r2  x  ...  x  rm.  Now  since 
r i  is  isomorphic  to  rq  x  {t2}  x  ...  x  {<TO},  it  follows  that  rq  is  a  member  of 
SP{rx  x  r2  x  ...  x  rm\.  Similarly,  we  can  show  r ,  €  SP{rx  x  r2  x  ...  x  rm) 
for  i  =  2,3, ...,  m. 

We  now  establish  a  sequence  of  results  to  prove  our  main  result. 

Lemma  3.3  Let  {F,  |  i  €  1}  be  a  collection  of  ID-families,  then  G  =  (T{Fi  I 
i  €  /}  is  an  ID- family. 

Proof.  Since  singleton  relations  satisfy  all  IDs,  it  is  clear  that  G  ^  I.  To 
prove  the  lemma,  we  will  use  theorem  2.1.  Let  rx,r2  €  G ,  then  rx,r2  €  F,  for 
each  i.  Therefore,  x  r2  €  F,  for  each  i.  Hence,  rq  x  r2  €  G  and  G  is  closed 
under  products.  Similarly  we  can  prove  that  G  is  closed  under  substructure. 

Definition  3.1  Let  X  be  a  collection  of  relations  over  R,  then  the  smallest 
ID- family  containing  X  is  defined  to  be: 

G(X)  =  niT1  |  X  C  F  and  F  is  an  ID-family  } 
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Lemma  3.3  together  with  the  fact  that  X  C  5.47(0)  implies  that  G'(.V) 
always  exists. 

Theorem  3.1  Let  X  =  {r},  then  SPX  is  an  ID-family  and  r  is  an  Arm¬ 
strong  relation. 

Proof.  Since  SPX  is  closed  under  S  and  P,  then  by  theorem  2.1,  SPX  = 
5.47(E)  for  some  set  of  IDs  E.  The  definition  of  G(X),  smallest  ID-family 
containing  X,  and  theorem  2.1  together  imply  that  SPX  =  G(X). 

Let  T  =  {7  |  r  t=  7  and  7  is  an  ID  },  then  by  definition  of  G(X),  we  have 
SPX  C  SAT(r).  Also,  since  every  member  of  E  is  true  in  r,  we  have  E  C  F 
which  implies  5/47(1")  C  5.47(E).  This  shows  that 


G{X)  =  SPX  =  SAT(T)  =  5.47(E) 

Now  we  show  that  r  is  an  Armstrong  relation  for  E.  It  is  obvious  that 
any  a  which  is  the  logical  consequence  of  E  is  true  in  r.  Suppose  a  is  not  the 
logical  consequence  of  S,  then  there  exists  a  relation  s  6  5.47(E)  such  that 
a  is  false  in  s.  Now,  if  a  is  true  in  r,  then  a  will  be  a  member  of  T.  But  this 
is  a  contradiction  since  s  €  5.47(E)  =  SAT(T). 

Finally,  we  show  that  the  collection  of  finitely  generated  ID-families  is  the 
same  as  the  collection  of  ID-families  possessing  a  finite  Armstrong  relation. 

Theorem  3.2  The  collection  of  finitely  generated  ID-families  and  the  col¬ 
lection  of  ID-families  possessing  a  finite  Armstrong  relation  are  identical. 

Proof.  By  theorem  3.1  and  lemma  3.2,  finitely  generated  ID-families 
possess  finite  Armstrong  relations.  On  the  other  hand,  suppose  F  possesses 
a  finite  Armstrong  relation  r.  Since  SP{r }  =  5.47(E)  is  the  smallest  ID- 
family  containing  r,  it  follows  that  5.47(E)  C  F.  Now  let  s  €  F  and  suppose 
s  is  not  a  member  of  5.47(E),  then  there  exists  a  a  6  E  which  is  false  in  s. 
Since  r  is  an  Armstrong  relation  for  F,  it  follows  that  a  is  false  in  r.  But  this 
is  a  contradiction  as  r  €  5/47(E).  This  shows  that  F  C  5.47(E). 
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4  Final  remarks 


Let  r  =  {/},  then  F  =  SP{r}  is  the  collection  of  all  singletons  together 
with  0.  F  can  be  axiomatized  by  the  set  of  all  IDs.  In  addition.  F  can  be 
axiomatized  by  the  following  finite  set  of  IDs: 

Vxi...VxnV?/i...Vj/n(/?(ii, x2,  A  /?(yi,j/2,  -.,*/n)  -*•  Xi  =  J/i) 
Vxl...VxnVyl...Vyn(R(xl,x2< ... ,xn )  A  /?(yi,  y2,  -,!/n)  x2  =  y2) 


Vx1...V.rnVf/i...Vi/n( /2(xi ,  x2 . xn)  A  R(yt,  t/2, ....  i/n)  —  xn  =  j/„) 

This  example  motivates  one  to  investigate  the  relationship  between  finitely 
generated  and  finitely  specifiable  ID-families.  Vardi [S]  has  constructed  a  fi 
nite  set  of  IDs  with  no  finite  Armstrong  relation.  This  together  with  theorem 
3.2  shows  that  finitely  specifiable  ID-families  are  not  finitely  generated.  We 
do  not  know  whether  finitely  generated  ID-families  are  finitely  specifiable. 
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